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5 Barkaround 

The invention relates to fusion protein expression systems for use 
in mammalian cells that enhance the production of a given target 
protein. More specifically, the invention relates to a secretion 

0 cassette, comprised of a mammalian signal peptide and a portion of 
mammalian innunoglobulins , which, when used as the amino-terminal 
fusion partner to the target protein, .generally leads to high level 
expression and secretion of the fusion product. Such fusion proteins 
are useful, for exan^sle. for the production and extracellular 

5 collection of target proteins without the need for lysis of a host 

cell. The invention is perhaps most useful for the expression of target 
proteins which are not normally secreted from a host cell, are secreted 
at low levels from a host cell, or are toxic or otherwise deleterious 
to a host cell . 



20 



Expression systems employing gene fusion constructs have been used 
to enhance the production of proteins in bacteria. Employing a 
bacterial protein that is normally expressed at a very high level as 
the amino-terminal fusion partner of a fusion protein helps to ensure 
efficient transcription and translation of the message, and in some 
cases the secretion and solubilization of the fusion protein (Smith and 
Johnson (1988) Gene 67:31; Hopp et al. (1988) Biotechnology 6:1204; La 
vallie et al . (1993) Biotechnology 11:187). 



30 The major goal of expression of recombinant fusion proteins in 

mammalian c lis has been to confer novel properties to the hybrid 
mol cules, e.g., targeting of a cytOcine or toxin in vivg , Fc receptor 
binding, complem nt fixation, protein A binding, increasing the half- 
life, and crossing the blood-brain barri r. Examples of recombinant 

35 fusion proteins produced in mammalian cells include cytokine 
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^ = ^^0 /r-niP=; Pr al (1<'92I Proc. Natl. Acad. Sci . tJSA 
89:1428; Gillies ec al. (1993) Bioconjugace Cherr,i sen' 4:230). 
imxunoadhesms (Capon et al . (1989) Nature 337:525), imir.unc toxins 
(Chaudhary et al . (19B9) Nature 339:394), and a nerve growth factor 
5 conjugate (Friden et al . (1993) Science 259:373). Each of the 

foregoing publications is incorporated herein by reference. Proteins 
produced in mammalian cells often do not have the solubility and 
secretion problems encountered in bacterial expression. The use of 
gene fusion constructs to enhance the production or secretion of a 
10 target protein in a mammalian system has not been explored fully. 

It is the object of the invention to provide DMAs which facilitate 
the production and secretion of a target protein. In particular, 
objects of the invention are to provide novel DNAs which: facilitate 
15 efficient production and secretion of hard to express proteins, such as 
nuclear proteins, regulatory proteins and proteins which otherwise may 
be toxic to a host cell, and can be adapted to any target polypeptide 
of interest which can be coded for and expressed in a host organism; to 
provide DNA constructs for the rapid and efficient production and 
20 secretion of proteins m a variety of host cells; and to provide a 
method for the production, secretion and collection of genetically 
engineered proteins, including non-native, biosynthetic , or otherwise 
artificial proteins, such as proteins which have been created by 
rational design. Other objects of the invention are to provide DNA 
25 sequences which, when fused to a polynucleotide encoding a target 

protein, encode a fusion polypeptide which can be purified using common 
reagents and techniques, and to interpose a proteolytic cleavage site 
between the encoded secretion cassette and the encoded target protein 
such that the secretion cassette can be cleaved from the target protein 
30 and the target protein can be purified independently. Still another 
object is to provide a procedure which is both efficient and 
inexpensive . 

These and other objects of the invention will be apparent from the 
35 description, drawings, and claims that follow. 
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Svimmarv Of The Invention 

The present invention features a DNA of general applicability for 
production and secretion of fusion proteins. The DNA comprises a 
secretion cassette, as the amino-terroinal fusion partner, and a target 
protein, and is termed herein an - immunofusin- . The invention 
provides, in its various aspects, a recoJtbinant DNA encoding the 
immunofusin, and inethods of producing the encoded immunofusin protein. 
The immunofusin is a DNA which comprises a polynucleotide encoding a 
secretion cassette, comprising in its 5' to 3 • direction a signal 
sequence and an immunoglobulin Fc region, and a polynucleotide encoding 
a target protein fused to the 3- end of the secretion cassette. A 
secretion cassette of the invention, once constructed, can be fused to 
various target proteins. Additionally, one can optimize the sequences 
which regulate the expression of a secretion cassette, and hence the 
expression of the immunofusin. The resultant DNA can be expressed at 
high levels in a host cell, and the fusion protein is efficiently 
produced and secreted from the host cell. The secreted immunofusin can 
be collected from the culture media without the need for lysis of the 
host cell, and can be assayed for activity or purified using common 
reagents as desired. 

The portion of the DNA encoding the signal sequence preferably 
encodes a peptide segment which directs the secretion of the 
immunofusin protein and is thereafter cleaved. As used in the 
specification and claims, -immunoglobulin Fc region" means the 
carboxyl -terminal portion of an immunoglobulin heavy chain constant 
region. As is known, each imnunoglobulin heavy chain constant region 
is comprised of four or five domains. The domains are named 
sequentially as follows: CHl - hinge - CH2 - CH3 (-CH4) , and the Fc region of 
each immunoglobulin subclass lacks at least the CHI domain. As is 
apparent fron-. z rsvi-v of dna sequences of the immunoglobulin 

subclasses, the DNA sequences of the h avy chain domains have cross - 
homology among the immunoglobulin class s, e.g., the CH2 domain of IgG 
is ' omologous to the CH2 domain of IgA and IgD, and to th CH3 domain 
of IgM and IgE . The portion of the DNA encoding the immunoglobulin Pc 
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region preferably comprises at least a portion of a hinge domain, and a 
CH3 domain of Fey or the homologous domains m any of IgA, IgD, IgE, cr 
IgK. The immunoglobulin Fc region also preferably comprises at least a 
portion of the DNA encoding a hinge and a CH3 domain of Fey or the 
5 homologous domains in any of IgA, IgD, IgE or IgK. 

The currently preferred secretion cassette is a polynucleotide 
encoding, in its 5" to 3 ' direction, the signal sequence of an 
immunoglobulin light chain gene and the Fcyl region of the human 

10 immunoglobulin yl gene. The Fcyl region of the immunoglobulin yl gene 
includes at least a portion of the hinge domain and CH3 domain, or at 
least a portion of the hinge domain, CH2 domain and CH3 domain. The 
DNA encoding the secretion cassette can be in its genomic configuration 
or its cDNA configuration. However, the studies described below use a 

15 secretion cassette in the genomic configuration. The use of human Fcyl 
as the Fc region sequence has several advantages. For example, if the 
fusion protein is to be used as a biopharmaceutical , the Fcyl domain 
may confer the effector function activities to the fusion protein. The 
effector function activities include the biological activities such as 

20 complement fixation, antibody-directed cellular cytotoxicity, ability 
for placental transfer, and a longer serum half -life. The Fc domain 
also provides for detection by anti-Fc ELISA and purification through 
binding to ^r«r.h^^ nrnccus aurCUS protein A ("Protein A") . In certain 
applications it may be desirable to delete specific effector functions 

25 from the Fc region, such as Fc receptor binding or complement fixation. 

In another embodiment the Fc region can be a murine immunoglobulin 
gene. The use of murine Pc as the Fc region can have advantages. For 
exaii?5le, if the fusion protein is to be used for the preparation of 

30 proteins in mice, then the murine Fc region will not elicit an immune 
response in the host animal. The Fc domain may confer the effector 
function activities to the fusion protein, and allow for detection of 
the fusion protein by anti-Fc ELISA and purification through binding to 
Protein A. In certain applications it may be desirable to delet 

35 specific effector functions from the Fc region. 
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In another embodiment the DKA sequence encodes a proteolytic 
cleavage site interposed between the secretion cassette and- the target 
protein. A cleavage site provides for the proteolytic cleavage of the 
encoded fusion protein thus separating the Fc domain from the target 
5 protein. As used herein, "proteolytic cleavage site" is understood to 
mean the amino acid sequences which are cleaved by a proteolytic enzyme 
or other proteolytic cleavage agents. As will be described in more 
detail below, useful proteolytic cleavage sites include amino acids 
sequences which are recognized by proteolytic enzymes such as trypsin, 
10 plasmin or enterokinase K. 

In a preferred embodiment the target protein sequence encodes 
prostate specific membrane antigen, PSMA. PSMA is a type II membrane 
protein, thus the extracellular domain, or soluble form of the protein, 
15 is utilized as the target protein sequence. The encoded soluble form 
of PSMA can be a human sequence such as the sequence provided in 
Israeli et al . (1993) Cancer Res., 53:227-ff. 

In another preferred embodiment the target protein sequence encodes 
20 the protein gpl20. The envelope protein gpl20 of human 

immunodeficiency virus is a glycoprotein which is expressed in infected 
cells as a polyprotein. gplSO, and then cleaved by a cellular protease 
to gpl20 and gp41. The nucleotide sequence and amino acid sequence of 
gpl20 is provided in Ratner et al., 1985, Nature, 313:277-ff. 

25 

in another aspect, the DKA sequence of the invention is integrated 
within a replicable expression vector. As used herein, -vector" is 
understood to mean any nucleic acid comprising a nucleotide sequence of 
interest and competent to be incorporated into a host cell and to be 

30 recombined with and integrated into the host cell genome, or to 

replicate autonomously as an episome. Such vectors include linear 
nucleii. acids -las-ids, phsgemids . rn^mids and the lilte. A preferred 
expression vector is pdC, in which the transcription of the immunofusin 
DNA is placed under the control of the enhancer and promoter of the 

35 human cytomegalovirus. The vector pdC was derived from pdEMp, which is 
described m Lo et al . 1991, Biochim. Biophys . Acta 1088:712 (which 
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publicacion IS mcorporaced hereir. by reference! as follows. The Sall- 
Xhol fragment containing the original enhancer and promoter sequence 
were replaced by the enhancer and promoter of the human cytomegalovirus 
by standard molecular biology- technioues. The enhancer and promoter 
sequence of the human cytomegalovirus used was derived from nucleotides 
-601 to *l of the sequence provided in Boshart ec al . , 1985, Cell 
41:521, which is incorporated herein by reference. The vector also 
contains the mutant dihydrof olate reductase gene as a selection marker 
(Simonsen and Levinson (19831 Proc. Nat. Acad. Sci . USA 60:2495, 
incorporated herein by reference) . 

An appropriate host cell can be transformed or cransfected with the 
DNA sequence of the invention, and utilized for the expression and 
secretion of a target protein. Currently preferred host cells for use 
in the invention include immortal hybridoma cells, myeloma cells, 293 
cells. Chinese hamster ovary cells, Hela cells, and COS cells. As used 
herein, "gene expression- or "expression of a target protein- is 
understood to refer to the transcription of the DNA sequence, 
translation of the mRNA transcript, and secretion of the fusion protein 
product . 

The method of the invention involves providing a DNA sequence 
encoding an immunofusin, transfecting the DNA sequence into a host cell- 
by an available transfection or transformation technique, culturing the 
transfected host cell in a suitable media under conditions which 
promote the expression and secretion of the immunofusin. and collecting 
the fusion protein from the extracellular media. When desired, the 
target protein may be cleaved from the secretion cassette either before 
or after it is collected from the extracellular media. 

Other advantages and features of the invention will be apparent 
from the description, drawings, and claims which follow. 
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R^Tpf Description yh^ Drawing 

The Figures lA-D are a schematic illustration of an irr-munof usm . 
Fioure lA, -DKA, - illustrates the DNA encoding an immunofusin protein. 
Figure IB, "Fused Protein 1," illustrates the immunofusin protein prior 
to cleavage of the signal sequence. Figure IC, "Fused Protein 2," 
illustrates the immunofusin protein after cleavage of the signal 
sequence. Figure ID, -Target Protein," illustrates the target protein 
portion of an immunofusin protein after cleavage of the immunofusin 
protein at the cleavage site which is interposed between the Fc region 
and the target protein. 
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Dp-ailed DescriD-].on 

The present invention is a DNA conr.prismg a polynucleotide 
encoding, in the 5- to 3 ' direction, a signal sequence, an Fc region cf 
an iniTTiunoglobulin, and a target protein. This approach to the 
expression and subsequent secretion of a target protein is superior tc 
the existing techniques because of the choice and the configuration of 
the secretion cassette which is placed at the 5' end of the fusion 
construct. Additionally, the regulatory sequences which direct the 
expression of the secretion cassette can be optimized, and the 
optimized secretion cassette can be paired with numerous target 
proteins, thus allowing for the efficient production of numerous fusion 
proteins . 

The production of the immunofusin proteins is characterized as 
efficient and high level, because the target protein has been produced 
at the level of several micrograms/milliliter using the DNAs and 
methods according to the invention. Previously, workers in the art 
have rarely quantitated the expression levels of hard to express 
proteins due to the low levels of expression that are obtained in the 
known mammalian expression systems and the difficulties faced in 
quantitating proteins by techniques such as western blotting and RIA. 
Prior to the teachings of this invention, expression of microgram per 
milliliter of hard to express proteins would often be attempted using 
bacterial expression systems. 

This invention is based on the concept that the ease of production 
and collection of a target protein could be improved if the polypeptide 
of interest were linked to an immunoglobulin Fc domain and the fusion 
protein were expressed m a host cell, in particular a complementary 
host cell which naturally expresses the immunoglobulin, such that the 
fusion protein would be readily secreted from the host cell. In 
addition to promoting the secretion of the fusion protein from the host 
cell, the Fc region can further be exploited to aid in the purification 
of the fused polypeptide. The general approach of the invention 
involves the construction of recombinant DNA which encodes a fused 
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polypeptide, which upon expression, results m expression of a 
secretion cassette linked to a target protein, i.e., a protein of 
interest having potential or demonstrable utility. 

5 The overall structure of the preferred DNA of the invention, the 

fusion protein it encodes, the form of the protein which is most often 
secreted and the target protein product after enzymatic cleavage are 
illustrated schematically in Figures lA-D. Reference characters in the 
DNA, Figure lA, are carried over into the protein. Figures XB-D, as 

10 corresponding primed characters. The DNA which encodes the itntnunofusin 
is shown between the start and the stop markers on the illustrated DNA 
sequence. Figure lA. Dpstream regulatory elements are shown at the 5" 
end of the DNA and are labeled "regulatory sequences". The DNA is 
composed of three distinct polynucleotides which are linked together. 

15 In Figure lA, 3" of the regulatory sequences, which may be optimized 

for each secretion cassette, is a first DNA 6 which encodes a secretion 
cassette coir^irising two of the three polynucleotides: l) a signal 
sequence 10, and 2) an immunoglobulin Fey region 12. The 
immunoglobulin Fcr region is comprised of three subregions : 1) a hinge 

20 region 14, 2) a CH2 region 16, and a CH3 region 20. Attached to the 3' 
end of the DNA encoding the secretion cassette is the third 
polynucleotide, a DNA encoding the target protein 24. Optionally, DNA 
encoding a proteolytic cleavage site 22 can be interposed between the 
DNA encoding the CH3 region of the immunoglobulin Fey region and the 

25 DNA encoding the target protein. 

The encoded fused protein compriseB the secretion cassette 8' and 
the target protein 24', shown as Fused Protein l in Fig. IB. Most 
often the signal peptide 10' will be enzymat ically cleaved from the 

30 fusion protein by the host cell prior to the secretion of the 

immunofusin, and thus Fused Protein 2, shown in Figure IC, shows the 
secieLeu luScu "rotsin -hich ccTT-prises the pppriiip 12 ■ fused to the 
target polypeptide 24' . Both Fused Protein 1 and Fused Protein 2 show 
the optionhl interposition of a proteolytic cleavage site 22' between 

35 th CH3 domain 20' of the Fcr region 12- and the target protein 24'. 

Cleavage of either Fused Protein with the appropriate proteolytic agent 
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at tne cleavage sxte 22' results .n che release of rhe targe, prcte.n 
24- from the Fc region 12', as shown in Fig-^re ID. 

The processes for manipulating, amplifying and recombinmg DNAs are 
generally well known in the art, and therefore are not described in 
detail herein. Methods of identifying and isolating genes encoding 
proteins of interest, or for constructing such genes, are well 
understood and developed. In general the methods involve selecting 
genetic material coding for amino acids which define the polypeptide of 
interest according to the genetic code. 

Accordingly, Che DNA construction principle disclosed herein can be 
exploited using known recombinant DNA techniques involving the use of 
various restriction enzymes which make sequence specific cuts in DNA to 
produce blunt ends or cohesive ends, DNA ligase techniques enabling 
enzymatic addition of sticky ends to blunt ended DNA, construction of 
synthetic DNAs by assembly of short oligonucleotides, cDNA synthesis 
techniques, polymerase chain reaction, and synthetic probes for 
isolating genes having a particular function. Various promoter 
sequences and other regulatory DNA sequences used in achieving 
expression, and various types of host cells are also known and 
available. Conventional transfection techniques, and equally 
conventional techniques for cloning and subcloning DNA are useful in 
the practice of this invention and known to those skilled in the art. 
various types of vectors may be used such as plasmids and viruses 
including animal viruses. The vectors may exploit various marker genes 
which impart to a successfully transfected cell a detectable phenotypic 
property that can be used to identify which of a family of cells has 
successfully incorporated the recombinant DNA of the vector. Given the 
foregoing state of the genetic engineering art. skilled persons are 
enabled to practice the invention disclosed herein in view of this 



one method for obtaining the DNA encoding the various synth tic 
35 linkers disclosed herein is by assembly of synthetic oligonucleotides 
m a conventional, automat d, polynucleotide synthesizer followed by 
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l.gatxon wich a l.gase. Fcr example, the Lathers can be synthesized as 
complenientary DNA fragments us.ng phosphoramidite chemistry. 

The signal sequence of the invention is a polynucleotide which 

encodes an amino acid sequence that initiates transport of a protein 

across the membrane of the endoplasmic reticulum. Signal sequences 

which will be useful in the invention include antibody light chain 

signal sequences, e.g., antibody 14.18 (Gillies et. al.. 1989. Jour, of 

m^nol. Meth.. 125:191-202). antibody heavy chain signal sequences. 

e.g., the M0PC141 antibody heavy chain signal sequence (Sakano et al . , 

l9eo'. Nature 286:5774), and any other signal sequences which are known 

in the art (see for example, Watson. 1984, Nucleic Acids Research 

12:5145). Each of these references is incorporated herein by 

reference. Signal sequences have been well characterized in the art 

and are known typically to contain 16 to 30 amino acid residues, and 

may contain greater or fewer am.no acid residues. A typical signal 

peptide consists of three regions: a basic N-terminal region, a central 

hydrophobic region, and a more polar C-terminal region. The central 

hydrophobic region contains 4 to 12 hydrophobic residues that anchor 

the signal peptide across the membrane lipid bilayer during transport 

of the nascent polypeptide. Following initiation, the signal peptide 

is usually cleaved within the lumen of the endoplasmic reticulum by 

cellular enzymes known as signal peptidases. Potential cleavage sites 

of the signal peptide generally follow the -(-3. -D rule". Thus a 

25 typical signal peptide has small, neutral amino acid residues in 

positions -1 and -3 and lacks proline residues in this region. The 

signal peptidase will cleave such a signal peptide between the -1 and 

*1 amino acids, "rtius , the portion of the DNA encoding the signal 

sequence may be cleaved from the amino-terminus of the immunofusin 

30 protein during secretion. This results in the secretion of a 

irronunofusin protein consisting of the Fc regxon and the target protein. 

- . .. _/r „; — .1 „.„fiH-. <;»»cnjence3 is provided by von 
A aetaiiea uit><.u*» 

Heijne (1986) Nucleic Acids Res., 14:4683 (incorporated herein by 
reference). As would be apparent to on of skill in the art, the 
35 suitability of a particular signal sequence for use in th seer tion 
cassette may require some routine experimentation. Such 



15 



20 
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will ir.cluae deterrr.ining Che ability of the signal 
to direct the secretion of an immunofus in and also a 



experiinen 
sequence 

determination of the optimal confi 



uration, genomic or cDNA, of the 



equence to be used in order to achieve efficient secretion of 



immunofusins . Additionally, one 



sk.illed in the art is capable of 



creating a synthetic signal peptide following the rules presented by 
von Heijne, referenced above, and testing for the efficacy of such a 
synthetic signal sequence by routine experimentation. A signal 
sequence is also referred to as a "signal peptide", "leader sequence" 
or "leader peptides' and each of these terms having meanings synonymous 
to signal sequence may be used herein. 

The Fc region of an immunoglobulin is the amino acid sequence for 
the carboxyl -terminal portion of an immunoglobulin heavy chain constant 
region. The Fc regions are particularly important in determining the 
biological functions of the immunoglobulin and these biological 
functions are termed effector functions. As known, the heavy chains of 
the immunoglobulin subclasses comprise four or five domains: IgM and 
IgE have ^ive heavy chain domains, and IgA, IgD and IgG have four heavy 
chain domains. The Fc region of IgA, IgD and IgG is a dimer of the 
hinge -CH2-CH3 domains, and in IgM and IgE it is a dimer of the hinge - 
CH2-CH3-CK4 domains. Further the CH3 domain of IgM and IgE is 
structurally equivalent to the CH2 domain of IgG, and the CH4 domain of 
IgM and IgE is the homolog of the CH3 domain of IgG (see, W.E.Paul, 
ed., 1993, Fundamental Immunology, Raven Press, New York, New York, 
which publication is incorporated herein by reference) . Any of the 
known FC regions would be useful as the Fc region of the secretion 
cassette. However, it is important that the binding sites for certain 
proteins be deleted from the Fc region during the construction of the 
secretion cassette. For example, since coexpression with the light 
chain is unnecessary, the binding site for the heavy chain binding 
protein. Bit> (Hendershot et al . !1987) Tmmung; . Tgd^y fi: 111-114). 
Should be deleted from the CH2 domain of the Fc region of IgE, such 
that this site does not interfere with the efficient secretion of the 
immunofusin. Likewise, the cysteine residues present in the Fc regions 
which are responsible for binding to the light chain of the 
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immunoglobulin Should be deleted or substituted with another amino 
acid, such that these cysteine residues do not interfere with the 
proper folding of the Fc region when it is produced as an immunof usir. . 
In the same manner, transmembrane domain sequences, such as those 
present in IgM. should be deleted such that these sequences do not 
result in misdirecting the immunofusin to the membrane as a 
transmembrane protein. 

Upon expression and production of the Fc region as a portion of the 
secretion cassette, it may retain some of the biological properties, 
termed -effector functions', which are native to the particular 
immunoglobulin class from which the Fc region is obtained. Useful 
effector functions include, for example, con^jlement fixation, Fc 
receptor binding, binding to cell membranes, and placental transfer, 
in some cases, it may be advantageous to modify or remove one or more 
of these effector functions, such as Fc receptor binding or complement 
fixation, using site directed mutagenesis or other well known molecular 
biology techniques. For example, Duncan et al . {Nature, 1988, 332:738) 
have mapped the amino acids responsible for the several of the 
immunoglobulin gamma effector functions activities, see also, Duncan 
et al., 19B8, 332:563; Yasmeen et al., Immunol.. 1976, 116:518; Tao et 
al., J. Immunol., 1989, 143:2595. Each of these publications is 
incorporated herein by reference. The amino acids or peptide segments 
responsible for these functions can be deleted thus removing that 
portion of the Fc region, or substituted with sequences which would not 
confer the function using well known molecular biology techniques. 

The currently preferred class of immunoglobulin from which the Fc 

region is derived is immunoglobulin gamma-l, because it has been well 

characterized and is efficiently secreted from most cell types. The Fc 

region of the other subclasses of immunoglobulin gamma (gamma-2, gamma- 

•. J ^ - — — w»ii i rhi» R^rr^tion cassette. 

J ana gamm<j-t; wuuj.^ ^ uw,-!....^., ^^^^^.j -•- - 

The Fc region of immunoglobulin gamma-l is preferably used in the 
secretion cass tte includes at least part of the hinge region, CK2 
region, and CH3 region. In addition, th Fc region of immunoglobulin 
gamma-l can be a CH2 -deleted-Fc . which includes a part of a hinge 
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region and a CK3 regior. wherein rhe CH2 regicr. has been deieced. A 
CH2-delened-Fc has been described by Gillies et al., 1990, Hutr,. 
Antibod. Hybridomas, 1:47, which publication is incorporared herein by 
reference . 

As is apparent from the above discussion of Fc regions, the Fc 
regions from the other classes of immunoglobulins, IgA, IgD , IgE, and 
igM, would also be useful as the Fc region of the secretion cassette. 
Further, deletion constructs of these Fc regions, in which one or more 
of the constant domains are deleted would also be useful. One of 
ordinary skill in the art could prepare such deletion constructs using 
well known molecular biology techniques. 

The identity of the target protein produced in accordance with the 
invention is essentially unlimited. Indeed, an ittportant feature of 
the invention is that it provides a generalized DNA construct, and 
procedure which can be adapted to facilitate recombinant production of 
any desired target protein. For instance, the application of the 
invention to the expression of the regulatory proteins, such as 
transcription factors which are normally localized to the nucleus, 
allows for the efficient secretion of such normally non-secreted 
proteins. In addition, regulatory proteins are in general difficuTt to 
express and the purification procedures are generally cumbersome {a££, 
for example. Meisterernst et al . (1991) Cell 66:981). Therefore, it is. 
especially desirable that such proteins be exported into the culture 
mediuTT,. Additionally, the invention can be used to enhance the 
production and secretion of proteins which are normally secreted at low 
levels. If a desired target protein includes sequences encoding a 
secretion signal or a transmembrane signal, these sequences can be 
) removed from the target protein such that the secretion cassette 
directs the secretion of the fusion protein. 

The optional proteolytic cleavage site may be any amino acid 
sequence which is recognized by specific cleavage agents. The 
i specificity of cleavage agents is d termin d by the identity of the 
seq^jence of amino acids at or near the peptide bond which is to be 
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hydrolyzed. A given cleavage ager.c may reccgr.ize the bond between twc 
specific amino acids or may recognize a bond following one or a 
specific secraence of am.no acids. The specificity of many cleavage 
agents is known. Table 1 set forth below lists various known cleavage 
agents and their primary (and in some cases secondary) sites of action 



npavaae Agent 

Trypsin 

Chymotrypsin 

Elastase 

Pepsin 
Papain 
Subtilisin 

Thermolysin 

S . aureus protease 

Endoproteinase 

Arg C (Submaxillar is 

protease) 

Clostripain 

Thrombin 

CollagenaBe 



Maio: 



TABLE 1 

,qit-e of Action 



Arg , Lys 
Tip, Phe, Tyr 
Neutral Aliphatic 
Residues 
Phe, Leu, Trp 
Arg, Lys, Gly 
Aromatic and 
Aliphatic residues 
Amino- linked bonds 
of Aliphatic Residues 
Glu 



HT-hPT- .ciires of 
Art ion 



Leu, Met, His 



Ala, Gly, Glu 
Wide specificity 
Various 



Arg 



Arg 
Arg 

X-Gly-Pro 



X-Ala-Pro 
X-Gly-Thr 



Lysobacter Lys 
enzymogenes 
(endoproteinase Lys-C) 
Mysobacter Al-1 Lys 



Armillaria mellea Lys 

Flavobacterium Pro 
35 meringosepticutr. 

Factor Xa lle-Glu-Gly-Arg 
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CNBr Met 

BNPS-skarole Trp 
K-bromosuccir.irr.ide Trp 

O- iodosDbenzoic Trp 
acid 

HBr/DMSO Trp 

NTCB Cys 
Sodium metal in 

liquid ammonia Pro 

Hydroxy 1 amine Asn-Gly 

Dilute acid Asp-Pro 

Other cleavage agents are known. Those preferred for use in the 
invention are enzymes with a primary site of action which cleave at the 
C-terminal side of the cleavage site residue. 

The cleavage site in the fused protein generally can comprise any 
one or sequence of amino acids which can be cleaved by a cleavage agent 
specific for the site in an appropriate environment. Specificity of 
cleavage can be increased, and likelihood of undesired cleavage within 
the target protein or elsewhere in the fused polypeptide can be 
decreased, by selecting the cleavage agent having a site of action 
which is absent from the target polypeptide. The fused polypeptide is 
preferably cleaved under conditions in which it has assumed its native 
conformation. This has the effect of masking the presence of potential 
cleavage sites in the target polypeptide. 

The invention is illustrated further by the following non-limiting 
examples . 



30 



Example l. Construction of a Secretion Cassette 



The construction of an exemplary secretion cassette is described 
below. As would be appreciated by those of ordinary skill in the art, 
35 the signal sequence and the Fc region of an immunoglobulin could be 
other sequences than those described. 
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The signal sequence of an immunoglobulin light chain of the 14.16 
antibody was selected for use as the signal sequence of the secretion 
cassette. The sequence of the 14.18 antibody light chain is provided 
m Gillies et al . , 1989, Jour. Immunol. Meth., 125:191-202 and is 
incorporated herein by reference. The signal seq^aence was modified for 
ease of cloning as an Xbal-Aflll fragment of the DMA. As would be 
apparent to those of skill in the art. the DKA encoding a human signal 
sequence could also be used. Specifically, an Xbal site was introduced 
5- of the translation initiation codon and the consensus sequence for 
optimal ribosome binding (Kozak, 1984, Nature 308:241, incorporated 
herein by reference) . An Aflll site was introduced into the 3- end of 
the signal sequence by mutagenizing the DNA coding for the penultimate 
amino acid residue of the signal peptide from a serine to a leucine, 
thus the sequence ATC was mutagenized to TTA using site directed 
mutagenesis. 

The Fc region of an immunoglobulin was selected to be the human 
Pcyl genomic DNA, including the genomic configuration of the hinge, CH2 
and CH3 domains. The genomic sequence of human Fcyl is provided in 
Huck et al., (1986) Nucleic Acids Res. 14:1779 and is incorporated 
herein by reference. As would be apparent to one of ordinary s)cill in 
the art, a CH2-deleted-Fc may also be used as the Fc region of the 
secretion cassette (see. Gillies et al . , 1990, Hum. Antibod. 
Hybridomas, 1:47), in which case the CH2 domain would be deleted from 
the FC region using established molecular biology techniques during the 

construction of the secretion cassette. The genomic DNA of Fcrl was 
modified for ease of cloning as an Aflll-Xmal fragment. The 5' end of 

the human Fc genomic DNA was mutagenized to an Aflll site by performing 

a Polymerase Chain Reaction (PCR) using a 5' sense primer with the 

following sequence (Sequence ID No. l) : 

r,&r,RATT rTTAAG CGAGCCCAAATCTTCTGACAAAACTCAC 

This primer introduc d an Aflll site (underlined) and a cysteine to 
serine mutation (TGT to TCT, bold) . The cysteine being mutated is the 
one that is normally involved in disulphide bonding with the light 
Cham and thus does not affect the effector functions of the Fc region. 
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xne deletion of this cysceine may serve to enhance zhe production of 
the Fcrrl region as the efficient production of this modified Fcyl 
region will not require Che coexpression of the immunoglobulin light 
chain. This cysteine was also removed such chat it does not interfere 
5 with the proper folding of the Fcyl region or the fused target protein. 
The 3- end of the Perl genomic DNA encodes for two Xmal restriction 
sites. They are located at 10 and 2B0 bp upstream of the translation 
stop codon in the CH3 domain. The distal Xmal site was destroyed by 
introducing a silent mutation, using site directed mutagenesis, (TCC to 
10 TCA, where the CC were the first two bases of the Xmal site) so that 
the Xmal site 10 bp upstream of the stop codon became unique. 

The Xbal-Aflll restriction fragment encoding the light chain signal 
peptide was then ligated to the Aflll-Xmal restriction fragment 
15 encoding the Fc region. The resultant Xbal-Xmal restriction fragment 
therefore encodes the secretion cassette, and the gene encoding the 
target protein of interest can be ligated to the 3' end of the 
secretion cassette via the Xmal site. 

20 In general, the DNA encoding Che target protein can be ligated to 

the unique Xmal site through the use of a linker-adaptor, such a 
linker-adaptor may also include restriction endonuclease sites ir. 
addition to an Xmal site. The use of a linker-adaptor has the 
additional feature in that it can encode a proteolytic cleavage site 

25 for subsequent use in cleaving the target protein from the secretion 
cassette after production and secretion of the fusion protein. For 
example, the linker -adaptor can encode a lysine residue at the junction 
of the fusion protein, which provides the option of cleaving the target 
protein from the Fc domain by proteolytic enzymes such as trypsin or 

30 plasmin. Similarly, the linker adaptor can include a DNA encoding the 
cleavage site of enterokinase K (Asp-Asp-Asp-Asp-Lys) in order to 
provide for the specific cleavage of the secreted fusion protein by 
enterokinase K. 

35 Example 2. Construction of an Immunofusip 
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The construction of an exemplary immunof us m , including a secretion 
cassette and a target protein is described below. As would be apparent 
to those of ordinary skill in the art, other target proteins can be 
fused to a secretion cassette using the same or other molecular cloning 
5 techniques. 

The target protein for the exemplary inmunofusin was chosen to be 
CD26, which is a type II membrane protein having its active site within 
the carboxyl- terminal region of the protein which is the extracellular 

10 domain. During the construction of a CD26 immunofusin, the cytoplasmic 
and transmembrane domains of CD26 were deleted so that they would not 
interfere with the secretion of the immunofusin by the secretion 
cassette. The 5' end of the cDNA encoding the extracellular domain was 
modified for ease of cloning to include a Xmal site, which was 

15 introduced via a linker-adaptor. The 3' end of the CD26 cDNA was also 
modified for ease of cloning to include a Xhol site, which could be 
introduced downstream of the translation stop codon either by PCR or by 
linker-adaptor ligation. 

20 Various linker-adaptors can be used depending upon the desire for 

introduction of a proteolytic cleavage site between the DNA encoding 
for the Fc region and the CD2G cDNA. For example, one linker-adaptor 
which can be used for c:D26 is: 

5' CCG GGT (AAA) GGC ACA GAT GAT GCT ACA G 

25 3- CA (TTT) TTG TGT CTA CTA CGA TGT C 

as provided in Sequence ID Nos. 2 and 3. The first three codons in the 
top strand encode the last three amino acid residues of the CH3 domain, 
and starting with the codon GGC is the gene sequence of the 
extracellular domain of CD26 . This linker-adaptor had the cohesive end 

30 of an Xmal site at its 5' end and the blunt end of a PvuII site at its 
3' end, the blunt ended PvuII site being a convenient site for 
reconstruction with the rest of the CD26 cDNA. The lysine codon (AAA, 
in parenthesis) in the linker -adapt or is but one of many optional amino 
acid sequences which ar useful to provide for a proteolytic cleavage 

35 site by cleavage agents. For exan^jle, this lysine residue can be 
cleaved by enzymes such as trypsin or plasmin. 
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Alternatively, for more specific proteolytic cleavage dv 
enterokinase K, the gene sequence encoding the enterokinase K cleavage 
site can be introduced via the following linker-adaptor: 
5 5 ' CCG GGT TCA GGG GAT GAC GAT GAC GAT A 

3- CA AGT CCC CTA CTG CTA CTG CTA TTC GA 

as provided in Seq. ID Nos . 4 and 5. The nucleotides in bold encode 
the amino acid residues (Asp)4-Lys, which is the recognition site of 
entero>cinase K. The linker -adaptor ends with a Hindlll site, to which 
10 the CD26 gene or other target protein gene sequences can be joined. 

Example 3. Host Cells and Tr ansfection 

The preferred host cell lines include the mouse myeloma (or 
15 hybridoma) NS/0 and Sp2/0 Agl4 cells. The myeloma cells were 

transfected by protoplast fusion and selected in Dulbecco's modified 
Eagle's medium (Gibco) containing 10% fetal bovine serum and 100 nM 
methotrexate, as described by Gillies et al . , 1989, BioTechnologj' , 
7:799, which publication is incorporated herein by reference. 
20 Transf ectants secreting the immunofusins were identified by anti-Fc 
ELISA, as described by Gillies et al . (1989) J. Immunol. Methods 
125:191, which publication is incorporated herein by reference. The 
highest producers were adapted to media containing 1 MM MTX and 
subcloned by limiting dilutions. For the production of immunofusins, 
25 the cells were grown in Hybridoma Serum-Free Media (HSFM, Gibco) 
containing IV fetal bovine serum and l nM MTX. 

The other preferred recipient cell line is the human kidney 2 93 
cells, which is useful for both transient and stable expression. Other 

30 cells, such as the HeLa and the Chinese hamster ovary (CHO^ cells, also 
worked in our system. The preferred method of transfection for these 
adherent cells is by coprecipitation of plasmid DNA with calcium 
phosphate, and other methods include lipofection and electroporat ion . 
For a description of these methods and other useful transfection 

35 methods 8 e, Sambrook et al . (1989) MP^ecU^Sf gloning--^ L^gbpr^tgry 
Manual . Cold Spring Harbor, NY, incorporated herein by reference. 
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Example 4. rharf^rr f>r^ gsT 1 ^-P^- PyrifJ^^ r mn of 

^inrryngf USmS 

Fo^ rourane character.zacion by gel eleccrophoresis . ™nofusxns 
in .he conditioned media were first captured on Protein A Sepharose 
(Repligen. Cambridge, MA, and then eluted by boiling .n protein sample 
buffer with or without 2-mercaptoethanol . After electrophoresis on an 
SDS-gel. the protein bands were visualized by Coomassie staining. For 
example, the 1L2 immunofusin, see exatrple 5, gave a band having the 
molecular weight of 45 KD under reducing conditions and a band having 
the molecular weight of 90 kD under non-reducing conditions, showing 
that the IL2 immunofusin was produced as a dimer, presumably through 
di sulphide bonding in the hinge domain of the Fc region. 

For purification, the cell culture media was collected and then the 
inununofusins were bound on Protein A Sepharose. The immunofusins were 
subsequently eluted from the Protein A in a sodxum citrate buffer ,100 
^ p„ 4, . The eluate was then immediately neutralized with O.l volume 
of'l M Tris-hydrochloride. pH 8. In the case of CD2S immunofusin. it 
was shown that such an elution procedure resulted in greater than 80* 
recovery of the CD2S immunofusin with no loss of enzyme activity 

Example 5. fn^r->..inn of TT,7 TPTOinPfUS i n 

The CDNA Of mature IL2 protein was modified for ease of cloning tc 
have a 5. xmal restriction endonuclease site and a 3' Xhol restriction 
endonuclease site using well known molecular techniques, such as those 
Which were as described in exanple 2. The sequence of the mature IL2 
30 CDNA IS provided in Taniguchi et al . , 1963, Nature, 302:30S and is 

.ncTporated herein by reference. The cD.A of the mature 11^ protein 

. . v,_.i^.oe ac a svnthetic gene in 

was constructed using recomoiimi.u ,— - 

order to optimize codon usage and to introduce desirable restriction 
endonuclease cleavage sites. The synthetic gene was created using 
3S conventional DNA manipulation techniques. Once the synth tic IL2 cDNA 
was constructed, the S' Xmal site of the IL2 cDNA was ligated to the 3' 
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Xmal site of the secretior. cassette, described ir. Example 1. The IL2 
immunofusir., was then cloned intc the expression vector pdC. The 11.2 
immanofusin expression vector was transfectec into KS/O and Sp2/C as 
host cells by protoplast fusion, as is described by Gillies et al . , 
5 1989, Biotechnology, 7:799. 

Two to three weefcs after cransfection, MTX-resistanc NS/0 and Sp2/0 
clones appeared. The initial clones were screened by anti-Fc ELISA. 
The IL2 immunof usin protein was collected from the media. An 

10 appropriate assay for the biological activity of IL2 was the standard 
T-cell proliferation assays according to Gillies et al . (Proc. Natl. 
Acad. Sci . (1992) 89:1428), which is incorporated herein by reference. 
The spent culture of the best clone contained about 100 »ig/ml of IL2 
irranunofusin. The host cell clones which efficiently produced and 

15 secreted the IL2 immunofusin protein were subcloned in media containing 
100 nM MTX, and the best subclone produced about 200 n-g/ml of protein 
in spent culture. When KTX was left out of the media in the 
subcloning, the best subclone thus isolated produced about 180 ^g/ml in 
spent culture. Thus, the construction of an IL2 iminunof usin 

20 unexpectedly provided for the production of IL2 at a level which is 
about 80 times that which can be achieved by the expression of IL2 
alone using the pdEMp vector (unpublished data) , and many times of that 
of the IL2 that was expressed in mammalian cells (Conradt et al . , 1989, 
J. Biol. Chem., 264:17368) and in yeast (Ernst et al., 1989, 

25 Biotechnology, 7:716). As mentioned in exarrple 4, IL-2 immunofusin was 
produced as a homo-dimer of molecular weight of 90 kD, presumably 
through disulphide bonding in the hinge domain of the 45 kD monomers. 



Example 6. ExDressio n of CD26 immunpfu^ln 

30 

The construction of CD2G as an immunofusin was undertaken to 
demonstrate that the invention is applicable to the expression of 
membrane anchored proteins such as type II membrane proteins. A type 
II membrane protein displays the carboxyl - terminal domain on the 
35 extracellular surface, and most often includes its active region within 
this carboxyl -terminal domain. The joining of a fusion polypeptide to 
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the carboxyl-cenr.inal region of such a protein may interfere with the 
proper folding of the active site, and thus reduce or prevent the 
production of active protein. 

CD26 is a type II membrane protein comprising 766 amino acid 
residues. The biological function of CD26 is as a T cell activation 
antigen and the putative coreceptor for entry of HIV in CD4+ cells 
(Callebaut et al . (19931 Science 262:2045). The CD26 protein is 
anchored to the lipid bilayer of the plasma membrane through a 
hydrophobic domain between residues 7 and 2B at the N-terminus . Amino 
acids 1 to 6 form a short cytoplasmic tail. The rest of the protein, 
between residues 2 9 and 766, is extracellular and includes several 
potential N-glycosylation sites and the active site of the enzyme 
(Tanaka et al . (1992) J. Immunol. 149:481). The 728 carboxyl- terminal 
residues in CD26 protrude from the membrane surface and the C- terminus 
is free. A soluble CD26 expressed as an immunoadhesin, will have a 
conformation different from that of the native CD26, because the 
carboxyl -terminus in an immunoadhesin CD26 protein is not free but 
connected to antibody sequence. On the other hand, if we engineer an 
immunofusin in which the antibody sequence is amino-terminal to the 
target protein, such as CD26, the native conformation of CD26 will be 
preserved, i.e. the C-terminus is free, and the antibody sequence, 
herein an Fc region, takes the place of the membrane to which CD26 is 
normally anchored. The enzymatic and biological activities of such a 
soluble CD26 immunofusin will not be compromised. In addition. CD26 is 
a protease and its expression may be deleterious to the host cell. 
Thus by efficiently exporting the CD26 protease outside of the host 
cell in the form of an immunofusin. a higher level of expression can be 
achieved. 

A 2.3 kb CDNA fragment encoding the extracellular domain of CD26 
w«« iis-d to construct the CD26 immunofusin expression vector. The DNA 
sequence of CD26 is provided in Tanaka et al., 1992, J. Immunol., 
149:481 and is incorporated herein by reference. CD26 was fused 3' of 
the secretion cass tte as d scrib d abov in example 2, and then the 
secretion cassett and CD26 target protein were cloned into the 
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expressior. vecrcr pdC using the xbal restricrion endonuclease site 5' 
ot the light chain signal sequence and the Xhcl restriction 
endonuclease sice 3' of the CD26 protein as described ir. example 2 
above. The resultant CD26 immunofusin expression vector was 
5 cransfected into a host cell as described m Example 3 above. NTX- 

resistant clones from transfected NS/0 and Sp2/0 cells were screened by 
anti-Pc ELISA and DPPIV activity assay. CD26 is also known as DPPrv, 
which is an exopeptidase that cleaves after amino- terminal X-P (X can 
be any amino acid residue, and P is proline) . DPPIV enzyme activity of 

10 the CD26 immunofusin was assayed according to Tanaka et al . , Proc . 
Natl. Acad. Sci., 1993, 90:4566, incorporated herein by reference, 
using glycylproline p-nitroanilide tosylate (Gly-Pro-pNA) as a 
substrate. The best NS/0 clone produced about 3.5 n.g/ml of CD26 
immunofusin. The DPPIV moiety of the protein product was determined to 

15 be fully active, having % and k^g^ values similar to those of the 

native CD26 . Furthermore the enzymatic activity of CD26 immunofusin 
was inhibited by known peptide inhibitors in a dose -dependent manner. 
The peptide inhibitors tested included the tripeptides IPI and VPL and 
APL, each of which inhibited the CD26 enzyme activity greater than 30V 

20 at 0.15 mM, greater than 70V at 1 mM and greater than 90V at 4 mM. As 
a control known non- inhibitor peptides were also tested for their 
effect upon CD26 enzyme activity and the known non- inhibitors , GGG and 
GPHyP (wherein HyP is hydroxproline) , were found to have no effect on 
the CD26 activity when incubated with the CD26 immunofusin at 

25 concentrations ranging between 0.01 mM and 11 mM. 

Example 7 . Expression of Tat immunofusin 

The invention was also applied to the expression of regulatory 
30 proteins which are normally localized to the nucleus. Because 

regulatory proteins are in general difficult to express and purify, it 
is esoeciellv desirable to devise a method by which such proteins can 
be efficiently secreted from a host cell . Immunofusin constructs of 
Tat and Rev (described m example 8) , which are two proteins encoded by 
35 the human immunodeficiency virus (HIV) that regulate expression of 

viral proteins in the cell nucleus, were made in order to determine the 
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efficiency with which these proteins can be expressed and collected. 
We obtained high level expression and secretion of the Tat and Rev 
immunof usins , and readily purified the immunof usins in a single step. 
A 26 0 base-pair cDNA fragment encoding Tat was cloned into the Xmal 
5 and Xhol sites of the pdC expression vector by modification of the 5 ' 
and 3' ends of the Tat protein using recombinant DNA techniques as 
described above. The sequence of the cDNA encoding the Tat protein is 
provided in Ratner et al., 1985, Nature, 313:277, and is incorporated 
herein by reference. Specifically, the sequence at the 5' end was 

10 modified to, Seq. ID No. G, C CCG GG T CGC ATG GAG ..... where the 
underlined sequence is the Xmal site and the ATG in. bold is the 
translation start codon of the Tat gene. At the 3' end, an Xhol eice 
was introduced immediately downstream of the translation stop codon by 
standard PCR techniques. The Tat immunofusin expression vector was 

15 then transfected into a host cell, as described above, and the host 
cells were analyzed for production of Tat immunofusin protein. High 
level e^q^ression was obtained in transiently transfected 293 cells and 
stably transfected NS/0 cells. Stable NS/0 clones produced about 3 
>ig/ml of a 48 kD protein, analyzed on a SDS-gel under reducing 

20 conditions. This protein was confirmed to be Tat immunofusin by an 
anti-Tat antibody (Cat. #7001, American BioTechnologies , Cambridge, 
MA) . 

The Tat immunofusin was shown to be active by the following 
25 transient expression experiment in 293 cells, the results of which are 
presented below in Table 2. The expression vector for Tat immunofusin 
was cotransf ected with a separate vector containing LTR- TAR -Kappa, 
where LTR-TAR is the long terminal repeat DNA sequence of HIV that is 
transact ivated by the Tat protein, and Kappa is the gene sequence 
30 encoding the Kappa light chain of imm'jnoglobulin . To measure 
expression levels of Fc-Tat (Tat immunofusin) and Kappa, the 
supernatants were assayed by anti-Fc and anti-Kappa ELISA resp ctively. 
In Table 2, pdC-Fc-Tat represents the pdC expression vector for Tat 
immunofusin; LTR-TAR-Kappa represents the expression vector for Kappa 
35 light chain, in which the LTR-TAR regulatory region can be 

transact ivated by Tat; and pCEP-Tat is an expression vector for Tat, 
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whose transcr.pt.or. xs under zne cor.crol of the human cyromegalov.rus 
enhancer and promocer . pCEP-Tat was used as a posatave control tc 
monitor the transact i vat xon of the LTR - TAK - Kappa by Tat protein. As a 
negative control LTR-TAR-Kappa was transfected alone to der^onstrate 
that It is not transactivated in the absence of Tat protein cr Tat 
immunofusin. As shown in Table 2, high level expression of the Tat 
immunofusin was observed in transfection 1; high level expression of 
both Tat and Kappa light chain were observed in the cotransf ection 
experiment, transfection 2. Transactivation of Kappa by Tat was seen 
in the positive control, transfection 3, as expected. Little or no 
expression of Kappa was seen in the negative control, transfection 4. 
also as expected. Therefore, the Kappa light chain is expressed only 
through transactivation of the LTR-TAR region by a functional Tat 
protein, and the Tat immunofusin provides a functional Tat protein 
which is readily secreted from the host cell. This result also 
demonstrates that the secretion cassette is able to direct the 
secretion of a protein which is normally transported to the nucleus of 
the host cell - 

Table 2 

^t.t<;r (no /ml) 



nfjA n^Pd in r^^r^cf^rrion _ EC KaCES 

1. ) pdC-Fc-Tat ° 

2. ) pdC-Fc-Tat, LTR-TAR-Kappa 1600 

3. ) PCEP-Tat. LTR-TAR-Kappa 0 277 
25 4.) LTR-TAR-Kappa ° ^ 

Example 8 . Pxpressio n O f Bgv TmmunoSusin 

A 350 base-pair cDNA fragment encoding Rev was modified to include 
30 a S' Xmal site and a 3 ' Xhol site and then ligated 3' of the described 

secretion cassette in the pdC expression vector. The sequence of the 

CDNA encoding the Rev protein is provided in Ratner et al . . 1985. 

Nature. 313:277, and is incorporated herein by reference. 

specifically, the B' end of the cDNA was modified to C CCC QQ T CGC ATG 
35 GCA . . . . (Seq. ID NO. 7), where the underlined sequence is the Xmal 

site and the ATG in bold is the translation start codon of the R v 
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gene. At the 3' end, an Xhol site was introduced immediately 
downstrean, of the translation stop codon by standard PCR techniques. 
High level expression was obtained m transiently transfected 293 cells 
and stably transfected NS/0 cells. Stable NS/0 clones produced about 3 
5 ^ig/106 cells/day of the Rev immunofusin, which has a molecular weight 
of about 50 kD when analyzed on a SDS-gel under reducing conditions. 

Example 9. f ^ Uf -^">.cif i r Protegmi^ n^f^vagf gf gn 
Tmmunofusin 

10 

An exemplary cleavage of an immunofusin is described below, as 
would be apparent to one of ordinary skill in the art, each of the 
above described immunofusins could be cleaved from their respective 
secretion cassettes using the same method or an analogous method. 

15 

A CD26 immunofusin having a lysine residue CFc (Lys) -CD26 
immunofusin-), introduced by linker adaptor during construction of Che 
immunofusin between the Fc region and the CD26 target protein sequence 
was cleaved using trypsin. To cleave the Fc(Lys)-CD26 immunofusin, Che 
20 immanofusin was bound on Protein A Sepharose and cleaved at the desired 
lysine position by trypsin to release CD26 as follows: Fc(Lys)-CD2e 
immunofusin bound on Protein A Sepharose was incubated with a 1* 
trypsin solution ac 37-C for 2 hr. Trypsin inhibitor (Sigma) was then 
added to stop any further digestion. The supematanc was Chen removed 
25 and analyzed on an SDS-gel under reducing conditions. After Coomassie 
staining, a band having a molecular weight of 110 kD, which corresponds 
to the size of CD26 without the secretion cassecte, was obtained. As a 
control, CD26 immunofusin, without the lysine residue at the junction 
of the fusion between the Fc domain and the CD26 target protein CFc- 
30 CD26 immunofusin-), was bound on Protein A Sepharose and similarly 
treated. The CD26 was found to not be released from the secretion 
C2C=stte -f th- Fr-CD26 immunofusin. as was expected, and this also 
confirmed the specific cleavag of the immunofusin at the amino acid 
lysine which was inserted between the CH3 domain of the Fc region and 
35 the target CD26 protein. As a further control, an identical aliquot of 
FC-CD26 irr^anofusm which was bound to Protein A Sepharose was boiled 
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in the protein sample buffer and SDS-gel analysis of Che supernatant 
showed a 140 k.D band corresponding to the full length CD26 immuncfusir. 
protein monomer. 

5 The results from the gel electrophoresis experiment were confirmed 

by DPPIV activity assays of the tryptic digests. Quantitative recover^' 
of the DPPIV enzymatic activity was obtained in the supernatant when 
the Pc{Lys)-CD26 immunofusir bound to Protein A Sepharose was treated 
with trypsin. In the parallel experiment with Fc-CD26 immunofusin, 
10 there was no DPPIV activity in the supernatant, because the CD26 
protein was not released from the Protein A Sepharose. 

Example 10. Expression of OSF-2 immunofusin 

15 OSF-2 is 3 80-kD secretory protein that is involved in the 

ossification process. The sequence the DNA encoding OSF-2 is provided 
in TaJceshita et al . , 1993, Biochera. J. 294:271, and is incorporated 
herein by reference. The cDNA encoding the OSF-2 protein with its 
signal peptide was cloned into the expression vector pdC. NS/0 cells 

20 were used for stable transfection and 293 cells were used for transient 
expression; but in neither case was the OSF-2 protein detected. 

The OSF-2 cDNA was then adapted to be expressed as an immunofusin.. 
At the 3' end, the Xbal site at the translation stop codon was 
25 converted to an Xhol site by linker ligation. At the 5' end the 
following linker-adaptor was used: 

5' CCG GO T AAA AAC AAT CAT TAT OAC AA 
3' £A TTT TTG TTA GTA ATA CTO TTC TAG 

as provided in Seq . ID Nos . 8 and 9. The nucleotides in bold encode 
30 the N-terminus of the mature OSF-2 protein, ending with Bglll cohesive 
ends. These Bglll cohesive ends were ligated to the Bglll-Xhol 
fragment of the OSF-2 cDNA. The Xmal cohesive ends at the 5' end of 
the linker-adaptor (underlined) were ligated to the unique Xmal site in 
the immunofusin expression vector. 

35 
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High level expression was ofctamed in cransiently transfecced 293 
cells and stably transfecced NS/0 cells. Stable NS/0 clones produced 
about 5 CO 7 ^ig/ml of a 110 )tD protein, when analyzed on a SDS-gel 
under reducing conditions. This protein was confirmed to be the OSF-2 
5 imx.unofusin by Western blotting with an anti -OSF-2 antibody. 

It was also found that the expression of OSF-2 as an immunofusin in 
a mammalian system was superior to the expression of OSF-2 in the 
tnioredoxin gene fusion expression system in E. coli (LaVallie et al . , 

10 1993, Biotechnology, 11:167). The thioredoxin gene fusion system was 
designed to circumvent the formation of inclusion bodies because fusion 
to thioredoxin increases the solubility of many heterologous proteins 
produced in the E. coli cytoplasm. To test this system for the 
expression of OSF-2, the cDNA encoding the mature OSF-2 was inserted 

15 into the Smal site of the pTrxFus vector (Invitrogen, San Diego, CA) , 
thus creating a thioredoxin OSF-2 fusion protein. The supplier's 
protocol for the expression of the fusion proteins was followed. The 
thioredoxin OSF-2 fusion protein was expressed, and, as a control, the 
thioredoxin protein was expressed alone without a fusion partner. The 

20 results showed that although thioredoxin alone could be produced as a 
soluble protein at a high level, the thioredoxin OSF-2 fusion protein 
was present only in the insoliable fraction. Therefore, in addition to 
the lack of post-translational modification in bacterial expression, a 
relatively complex mammalian protein such as OSF-2 was not synthesized 

25 as a soluble protein when fused to thioredoxin. 

Bxanple 11. Expression of BIG-H3 immunpfusin 

PIG-H3, a gene product which is induced by transf onning growth 
30 factor- P, is a 66 -kD secretory protein that shares sequence homology 
with OSF-2. The sequence of cDNA encoding pIG-H3 is provided in 
Skonier et al . (1992) DNA and Cell Biology, 11:511, and is incorporated 
herein by reference. The cDKA encoding the native PIG-H3 was cloned 
into the expression vector pdC; but att mpts to obtain stable 
35 transf ectants producing PIG-H3 were unsuccessful. 
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The PIG-H3 cDNA was then adapted to be expressed as an inununof us ir. . 
At the 3' end, the BsmI site downscrearr, of the translation stop codon 
was converted to an Xhol site by linker ligation. At the 5' end, the 
following linteer-adaptor was used: 
5 5 ' CCG GGT AAA GCC CTG OGC C 

3 • CA TTT CGC OAC 

(Seq ID. Nos . 10 and 11). The nucleotides in bold encode the N- 
terminus of the mature PIG-H3 protein. The linker-adaptor had Xmal 
cohesive ends for ligating to the expression vector as described in the 
10 above examples, and Apal cohesive ends for ligating to the Apal site at 
the 5 ' end of the cDNA sequence encoding the mature pIG-H3 . 

High level expression was obtained in transiently transfected 2 93. 
cells and stably transfected NS/0 cells. Stable NS/0 clones produced 
15 about 3.5 (og/lO^ cells/day of a 100 kD protein when euialyzed on a SDS- 
gel under reducing conditions. This protein was confirmed to be the 
PIG-H3 immunofusin by Western blotting with anti-pIG-H3 antibody. 

Example 12. Expression of the soluble form of laE 
20 receptor as an iminunofusin 

The high affinity IgE receptor alpha subunit (IgE-R) , the DMA 
sequence of which can be found in Kochan et al. (1988) Nucleic Acids 
Res. 16: 3584 and is incorporated herein by reference, was constructed 

25 as an immunofusin as follows: An Xmal site was introduced to the 5' 
end of the cDNA encoding the mature igE-R so that the sequence at the 
junction of the fusion was C CCG GG T OTC CCT CAO --- (Seq. ID No. 12) , 
where the Xmal site is underlined and the three codons in bold are the 
first three amino acid residues of the mature IgE-R. At the 3' end of 

30 the IgE-R, the cDNA encoding the transmembrane domain and the rest of 
the C-terminus was deleted and a translation stop codon was placed 
after the last codon of the extracellular domain. The sequence of the 
IgE-R immunofusin at the 3' end was thus TAC TGG CTA TAA CTC GAG (Seq. 
ID No. 13), where the three codons in bold were the last three amino 

35 acid residues of the extracellular domain of the IgE-R, and they were 
followed by a stop codon and an Xhol site (underlined) . 
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The pdC expression vector containing the IgE-R immunofusin was 
transfected into 293 cells and NS/0 cells. High levels of expression 
(3 to 5 ^g/ml) of the IgE-R immunofusin were detected in the cell 
5 culture media by anti (Fc) ELISA. SDS-gel analysis under reducing 

conditions showed a band of the expected size of 70 kD. The partially 
purified protein (on Protein A Sepharosel was shown to bind IgE m an 
IgE-R/IgE ELISA. 

10 Exan^jle 13. Rvnression of FCYl 

Pcyl was expressed by itself without a C-terminal target protein. 
This was achieved by ligating the following linker (having Xmal and 
Xhol cohesive ends) 
15 5- CCG GGT AAA TAG C 

3- CA TTT ATC GAG CT 

(Seq. ID Nos. 14 and 15) , to the Xmal and Xhol sites of the pdC to 
reconstruct the coding region of Fc. High levels of expression was 
detected by anti(Pc) ELISA in the cell culture media of the transiently 
20 transfected 293 cells (5 to 7 ng/ml) and stably transfected NS/0 clones 

(5 to 10 ^g/ml) . SDS-gel analysis under reducing conditions showed an 
Fc band of the expected size of 31 kD. 

Example 14. Re pression of PSMA imtnunof USln 

25 

PSMA, prostate specific membrane antigen, is a type II membrane 
protein having a molecular weight of greater than 100 kD. PSMA is an 
integral membrane protein, and as such it is an attractive target for 
imaging and immunoconjugate delivery. To facilitate the expression of 
30 significant quantities of PSMA, we subcloned the extracellular domain 
of PSMA (the soluble form) and expressed this domain of PSMA as an 
immunofusin. A portion of the extracellular domain of PSMA, which is a 
soluble form of PSMA, can be produced as an immunofusin. 

35 The cDNA encoding the full length PSMA was cloned from a human 

prostate carcinoma cell line LNCaP [Israeli et al . (1993) Cancer Res., 
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53:227, which p'obl ica::ion is incorpcraced herein by reference). The 
porLion of the PSMA cDNA corresponding tc the extracellular domair. was 
adapted to be expressed as an immunofusin by Polymerase Chair. Reaction 
using the following primers-. 
5 N-termtnal: 5' r^CCTT AAA TCC TCC AAT GAA GC 

C-terminal: 5' CTCGAG TTA GGC TAC TTC ACT CAA AG 
(Seq. ID Nos . 16 and 17) . The two primers provide the Hindlll and the 
Xhol sites (underlined) for cloning into the immunofusin expression 
vector. In the N-terminal primer, the Hindlll site is followed by the 
10 coding sequence of the extracellular domain of PSMA (in bold) 

immediately after the transmembrane region. In the C- terminal primer, 
the Xhol site is followed by the anticodon of the STOP codon and the C- 
terminal coding sequence of PSMA (in bold) . The amino acid sequence of 
the extracellular domain of PSMA is shown in Seq. ID No. 18. 

15 

High level expression was obtained in stably transfected 290 and 
Sp2/0 cells. The PSMA immunofusin secreted into the cell culture media 
was purified by Protein A Sepharose . Treatment of the immunofusin with 
the protease plasmin quantitatively converted the 130-kD Fc-PSMA into 
20 two products: the lOO-kD PSMA extracellular domain and the 3l-kD Fc. 
The Fc was then removed from the solution by adsorption onto Protein A 
Sepharose. The soluble PSMA was purified and used to immunize mice. 
It is expected that an antibody specific only to PSMA should facilitate 
diagnosis and therapy of prostate cancer. 

25 

Example 15 . Expressio n of Murine Fc 

The Fc region of murine y2a was prepared for expression as an 
immunofusin. Since the murine Fc region will not be immunogenic to 
30 mice, such an immunofusin containing the murine Fc followed by, for 
example, a human protein fusion partner can be used to immunize mice 
directly without prior cleavage to get rid of the Fc . The murine Fc 
was cloned into our immunofusin expression vector as described below, 
and was expressed at a high level under our expression conditions . 

35 
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The murine Fc 72a domain, preceded by the signal peptide described 
above, was cloned into an expression vector, pdC, and was expressed 
without fusion to a target protein. Murine Fc Y2a cDNA (Sikorav et 
al., 1980, Nucleic Acids Res., 8:3143-3155, which publication is 
5 incorporated herein by reference) was adapted for cloning into the 
expression vector by Polymerase Chain Reaction using the following 
primers : 

N-terminal: 5' CTTAAG C GAG CCC AGA 006 CCC ACA 
C-terminal: 5' CTCGAG C TCA TTT ACC CGG AGT CC6 

10 (Seq. ID Nos. 19 and 20) . The N-terntinal primer contains an Aflll site 
(underlined) for ligating to the Aflll site at the 3' end of the signal 
peptide, described above. The sequence following the Aflll site (in 
bold) encodes the amino acid residues in the hinge region of murine y2a 
gene. The C-terminal primer contains an Xhol site for cloning into the 

15 expression vector, followed by the anticodons of the translation STOP 
codon and the carboxyl end of murine y2a (in bold) . 

High level expression of the murine Fc72a region was demonstrated 
in 293 cells by SDS gel analysis followed by western blotting with an 
20 anti -murine IgG antibody. 

Example 16. Rynreasion of qpl2C 

The envelope protein gpl20 of human immunodeficiency virus (HIV) is 
25 a glycoprotein having a molecular weight of 1201cD, and is expressed on 
the surface of HIV particles and HIV infected cells. The protein gpl20 
is originally expressed in infected cells as a polyprotein,. gpl60, 
which is then cleaved by a cellular protein to gpl2D and gp4i. gpl20 
was prepared as an imraunofusin and determined that the gpl20 
30 immunofusin was expressed at a very high level. Any desired portion of 
gpl20 may also be prepared as immunofusin. The Fc moiety of the gpl20 
'usir. could be cle?'-"*"^ and gpl20 was purified. 

The complete nucleotide sequence of HIV has been published in 
35 Ratner et al . (1985) Nature, 313:277. and this publication is 

incorporated herein by reference. To prepare the gpl20 immunofusin, a 



wo 9€mS70 



PCTAJS95.'-1720 



-34 - 

translation STOP codon followed by an Xhol restriction site was 
introduced to the gpl20-gp4i junction after ammo acid Arg-518 of gpl6C 
using standard molecular biology techniques, e.g., polymerase chain 
reaction. The existing Ndel restriction site present at nucleotide 
5979, which is within the amino terminal portion of gpl20, was 
converted to a Hindlll restriction site through linker-adaptor ligation 
to generate and in-frame fusion. The resultant Hindlll-Xhol fragment 
(1.36 kilobase pairs) encoding gpl20 was then cloned into the 
immunofusin expression vector, pdC, as described above. 

The gpl20 immunofusin expression vector was expressed in stably 
transfected 293 cells according to the methods described above, and 
high level expression of the gpl20 immunofusin was obtained. The gpl20 
immunofusin was functionally active, as determined by binding to CD4 in 
an ELISA. The gpl20 itmnunofusin was also determined to be 
quantitatively cleaved by enterokinase to release gpl20 and the Fc 
region . 

Other Embodiments 

The invention may be embodied in other specific forms without 
departing from the spirit or essential characteristics thereof. The 
present embodiments are therefore to be considered in all respects as 
illustrative and non-restrictive, the scope of the invention being 
indicated by the appended claims rather than by the foregoing 
description, and all changes which come within the meaning and range of 
equivalency of the claims are therefore intended to be embraced 
therein . 
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SEQUENCE LISTING 



( 1 ) GENERAL INFORMATION : 



(i) APPLICANT: 

(A) NAME: FUJI IMMUNOPHARMACEUTICALS CORP. 

(B) STREET: 125 HARTWELL AVENUE 

(C) CITY: LEXINGTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) POSTAL CODE: C2173 

(G) TELEPHONE: (617) 861-5300 

(H) TELEFAX: (617) 861-5301 

(I) TELEX: 

(ii) TITLE OF INVENTION: EXPRESSION AND EXPORT TECHNOLOGY OF 
PROTEINS AS IMMUNOFUSINS 

(iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: PATENT ADMINISTRATOR, TESTA, HURWIT2 & 

THIBEAULT 

(B) STREET: 125 HIGH STREET 

(C) CITY: BOSTON 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Pacentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATICMJ: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: PITCHER, EDMUND R. 

(B) REGISTRATION NUMBER: 27,82 9 

(C) REFERENCE/DOCKET NUMBER: FlP-OOl 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-248-7000 

(B) TELEFAX: 617-248-7100 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

65 
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GAGAATTC-TT AAGCGAGCCC AAATC7TCTG ACAAAACTCA 



(2) INFORMATION FOR SEC ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



i) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 2: 

CCGGGTAAAG GCACAGATGA TGCTACAG 
28 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CTGTAGCATC ATCTGTGTTT TTAC 
24 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

CCGGGTTCAG GGGATGACGA TGACGATA 

26 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQLrENCE CHARACTERISTICS: 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singl 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



wo 96AW570 



PCT/US95/11720 



- 37- 

(jci) SEQUEKCE DESCRIPTION: SEQ ID NO: 5: 

AGCTTATCGT CATCGTCATC CCCTGAAC 
28 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: cDNA 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CCCGGGTCGC ATGGAG 
16 

!2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCCGGGTCGC ATGGCA 
16 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



55 CCGGGTAAAA ACAATCATTA TGACAA 

26 

{^} INfUKWATlON hUK SfcQ lij wO : 9 : 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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i) MOLECULE TYPE: cDNA 



5 (xi) SEQUENCE DESCRIPTION: SEQ ID K0:9: 

GATCTTGTCA TAATGATTGT TTTTAC 
26 

10 (2) INFORMATION FOR SEO ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

CCGGGTAAAG CCCTGGGCC 
25 19 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

CAGGGCTTTA C 
11 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



60 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANCEDNESS : Sir.jle 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

5 



(xi) SEQUENCE DESCRIPTION: SEG ID K0:13: 

10 TACTGGCTAT AACTCGAG 
16 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



15 
20 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CCGGGTAAAT AGC 
13 

30 (2) INFORMATION FOR SEQ ID NO: 15: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCGAGCTATT TAC 
45 13 

(2) INFORMATION FOR SEQ ID N0:16: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

60 

AAGCTTAAAT CCTCCAATGA AGC 



(2) INFORMATION FOR SEQ ID NO: IT: 

65 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTIOK: SEQ ID NO: 17: 

CTCGAGTTAG GCTACTTCAC TCAAAG 
26 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 amino acids 
(B> TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix> FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1..707 

(D) OTHER INFORMATION: /note= "EXTRACELLULAR DOMAIN OF 
PSMA" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Lys Ser Ser Asn Glu Ala Thr Asn He Thr Pro Lys His Asn Met Lys 
1 5 10 15 

Ala Phe Leu Asp Glu Leu Lys Ala Glu Asn lie Lys Lys Phe Leu Tyr 
20 25 30 

Asn Phe Thr Gin He Pro His Leu Ala Gly Thr Glu Gin Asn Phe Gin 
35 40 45 

Leu Ala Lys Gin He Gin Ser Gin Trp Lys Glu Phe Gly Leu Asp Ser 
50 55 60 

Val Glu Leu Ala His Tyr Asp Val Leu Leu Ser Tyr Pro Asn Lys Thr 
65 70 75 80 

His Pro Asn Tyr He Ser He He Asn Glu Asp Gly Asn Glu He Phe 
85 90 95 

Asn Thr Ser Leu Phe Glu Pro Pro Pro Pro Gly Tyr Glu Asn Val Ser 
100 105 110 



Gly Asp L u val Tyr Val Asn Tyr Ala Arg Thr Glu Asp Phe Phe Lys 

130 135 140 

Leu Glu Arg Asp Met Lys He Asn Cys Ser Gly Lys He Val He Ala 

145 150 155 160 
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Arg lyr Gly Lys Val Phe Arg Cly Isn Lys Val Lys Asn Ala Gin Leu 
165 

Ala Gly Ala Lys Gly Val He Leu lyr Ser Asp Pro Ala Asp lyr Phe 
180 

Ala pro Gly Val Lys Ser Tyr Pro Asp Gly Trp Asn Leu Pro Gly Gly 

195 200 

Gly val Gin Arg Gly Asr. lie Leu Asn Leu Asn Gly Ala Gly Asp Pro 



210 



Leu Thr Pro Gly Tyr Pro Ala Asn Glu Tyr Ala Tyr Arg Arg Gly lie 
225 230 235 

Ala Glu Ala Val Gly Leu Pro Ser He Pro Val His Pro He Gly Tyr 



, Glu Lys Met Gly Gly Ser Ala Pro Pro 
265 



ASP ser ser Trp Arg Gly Ser Leu Lys Val Pro Tyr Asn Val Gly Pro 

^ 275 280 285 

Gly Phe Thr Gly Asn Phe Ser Thr Gin Lys Val Lys Met His He His 

290 295 

ser Thr Asn Glu Val Thr Arg He Tyr Asn val He Gly Thr Leu Arg 

305 310 315 320 

Gly Ala val Glu Pro Asp Arg Tyr Val He Leu Gly Gly His Arg Asp 
325 330 

ser Trp Val Phe Gly Gly He Asp Pro Gin Ser Gly Ala Ala Val Val 

340 3*5 

His Glu He Val Arg Ser Phe Gly Thr Leu Lys Lys Glu Gly Trp Arg 
355 360 365 

Pro Arg Arg Thr He Leu Phe Ala Ser Trp Asp Ala Glu Glu Phe Gly 

370 375 -30" 

Leu Leu Gly Ser Thr Glu Trp Ala Glu Glu Asn Ser Arg Leu Leu Gin 
3B5 390 395 

Glu Arg Gly Val Ala Tyr He Asn Ala Asp Ser Ser He Glu Gly Asn 
405 

Tyr Thr Leu Arg Val Asp Cys Thr Pro Leu Met Tyr Ser Leu Val His 
420 *25 

Asn Leu Thr Lys Glu Leu Lys Ser Pro Asp Glu Gly Phe Glu Gly Lys 
435 440 

Ser Leu Tyr Glu Ser Trp Thr Lys Lys Ser Pro Ser Pro Glu Phe Ser 
450 *55 460 

Gly Met Pro Arg lie ser uys Leu Gly Ser Gly A=r. T^^p ^"1- ^^J^ 

465 *''0 

Phe Phe Gin Arg Leu Gly He Ala Ser Gly Arg Ala Arg Tyr Thr Lys 
485 

Asn Trp Glu Thr Asn Lys Phe Ser Gly Tyr Pro Leu Tyr His Ser Val 
500 505 
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TVr Glu T^- TV- G^u l.eu Val Glu Lys Phe Tyr Asp Pro Met Phe Lys 
515 520 525 

Tyr His Leu Thr Val Ala Gin Val Arg Gly Gly Met Val Phe Glu Leu 
530 535 540 

Ala Asn Ser lie val Leu Pro Phe Asp Cys Arg Asp Tyr Ala Val Val 
545 550 555 560 

Leu Arg Lys Tyr Ala Asp Lys lie Tyr Ser He Ser Met Lys His Pro 
565 570 575 

Gin Glu Met Lys Thr Tyr Ser Val Ser Phe Asp Ser Leu Phe Ser Ala 
580 585 590 

Val Lys Asn Phe Thr Glu He Ala Ser Lys Phe Ser Glu Arg Leu Gin 
595 600 605 

Asp Phe Asp Lys Ser Asn Pro He Val Leu Arg Met Met Asn Asp Gin 
610 615 620 

Leu Met Phe Leu Glu Arg Ala Phe He Asp Pro Leu Gly Leu Pro Asp 
625 630 635 640 

Arg Pro Phe Tyr Arg His Val He Tyr Ala Pro Ser Ser His Asn Lys 
645 650 655 

Tyr Ala Gly Glu Ser Phe Pro Gly He Tyr Asp Ala Leu Phe Asp He 
660 665 670 

Glu Ser Lys Val Asp Pro Ser Lys Ala Trp Gly Glu Val Lys Arg Gin 



675 



He Tyr Val Ala Ala Phe Thr Val Gin Ala Ala Ala Glu Thr Leu Ser 
690 695 700 

Glu Val Ala 

705 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(Xi) SEOUENCE DESCRIPTION: SEQ ID NO: 19: 

55 CTTAAGCGAG CCCAGAGGGC CCACA 
25 

(2! rHFOPJ^ATIO?" FO" ?P-0 NO: 20: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS: singl 
(D) TOPOLOGY: linear 

65 
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(ii) MOLECULE TYPE: CDNA 



5 (xi) SEQUENCE DESCRIPTION: SEO ID NO: 20: 

CTCGAGCTCA TTTACCCGGA GTCCG 
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What IS claimed is: 

1 1. A DNA produced by recombinant DNA techniques for inducing expression 

2 and subsequent secretion of a target protein, said sequence being free of 

3 immunoglobulin CHI and comprising a polynucleotide encoding, from its 5' to 

4 3' direction: 

5 A) a secretion cassette which comprises 

6 a signal sequence; 

7 an immunoglobulin Fc region; and 

8 B) a target protein sequence. 

1 2. The DNA of claim 1 wherein the target protein sequence encodes a 

2 soluble form of prostate specific membrane antigen. 

1 3. The DNA of claim l wherein the target protein sequence encodes at 

2 least a portion of gpl20 protein. 

1 4. The DNA of claims 1, 2 or 3 wherein the signal sequence encodes a 

2 signal peptide which directs secretion of the target protein and is then 

3 removed by enzymatic cleavage. 

1 5. The DNA of claim 1, 2 or 3 wherein the Fc region is altered to delete 

2 at least one effector function activity. 

1 6. The DNA of claim l, 2 or 3 wherein the Fc region comprises a hinge, a 

2 CH2 domain and a CH3 domain of immunoglobulin gamma. 

1 7. The DNA of claim l, 2 or 3 wherein the Fc region comprises a hinge 

2 region and a CH3 domain of immunoglobulin gamma. 

1 B. The DNA of claim 1, 2 or 3 further comprising a proteolytic cleavage 

2 site interposed 3 ■ of a portion of said polynucleotide encoding said 

3 immunoglobulin Fc region and 5' of a portion of said polynucleotide encoding 

4 said entire target protein. 

1 9. A replicable expression vector for transfecting a mammalian cell, said 

2 vector comprising the DNA of claim 1, 2 or 3. 



wo 96/0S570 



i»CT/rS95/11720 



-45 - 

1 IC. A host cell transformed with the DNA of claim 1, 2 or 3 . 

1 11. Target protein produced by culturing the cell of claim 10. 

1 12. A method of producing a target protein comprising the steps of: 

2 1) transfecting the DNA of claim 1, 2 or 3 into a host cell; 

.3 2) culturing the host cell in a medium under conditions to promote 

4 expression and secretion of a fusion protein comprising 

5 an immunoglobulin Fc region, and 
5 a target protein; and 

7 3) collecting the fusion protein from the medium. 

1 13. The method of claim 12 wherein the fusion protein has bioactivity c 

2 the target protein. 

1 14. The method of claim 12 further comprising the additional steps, aft 

2 step 3, of 

3 4) cleaving the Fc region from the target protein, and 

4 5) collecting the target protein. 

1 15. A method of producing a target protein comprising the steps of: 

2 1) transfecting the DNA of claim 1, 2 or 3 into a host cell; 

3 2) culturing the host cell in a medium under conditions to promote 

4 expression and secretion of a fusion protein comprising 

5 an immunoglobulin Fc region, and 

6 a target protein; and 

7 3) cleaving the Fc region from the target protein, and 

8 4) collecting the target protein from the medium. 
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